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Abstract —This paper presents a systematic approach to exploit 
TV white space (TVWS) for device-to-device (D2D) commu¬ 
nications with the aid of the existing cellular infrastructure. 
The goal is to build a location-specific TVWS database, which 
provides a look-up table service for any D2D link to determine 
its maximum permitted emission power (MPEP) in an unlicensed 
digital TV (DTV) band. To achieve this goal, the idea of mobile 
crowd sensing is firstly introduced to collect active spectrum 
measurements from massive personal mobile devices. Consid¬ 
ering the incompleteness of crowd measurements, we formulate 
the problem of unknown measurements recovery as a matrix 
completion problem and apply a powerful fixed point continu¬ 
ation algorithm to reconstruct the unknown elements from the 
known elements. By joint exploitation of the big spectrum data 
in its vicinity, each cellular base station further implements a 
nonlinear support vector machine algorithm to perform irregular 
coverage boundary detection of a licensed DTV transmitter. 
With the knowledge of the detected coverage boundary, an 
opportunistic spatial reuse algorithm is developed for each D2D 
link to determine its MPEP. Simulation results show that the 
proposed approach can successfully enable D2D communications 
in TVWS while satisfying the interference constraint from the 
licensed DTV services. In addition, to our best knowledge, this 
is the first try to explore and exploit TVWS inside the DTV 
protection region resulted from the shadowing effect. Potential 
application scenarios include communications between internet 
of vehicles in the underground parking, D2D communications in 
hotspots such as subway, game stadiums, and airports, etc. 

Index Terms —Mobile crowd sensing, device-to-device commu¬ 
nications, TV white space, cognitive radio, geolocation database 


1. Introduction 

R ecently, we have witnessed a dramatic growth in 
wireless data traffic. To provide higher capacity, re¬ 
searchers have been exploring the next generation wireless 
communication systems 0 - 0 . However, one crucial imped¬ 
iment is the shortage of radio spectrum 0 0 To develop 
efficient solutions, several approaches have been suggested, 
e.g., spectrum extension, network densification, and spectrum 
efficiency improvement, etc |[^. 
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The integration of device-to-device (D2D) communications 
into cellular networks is a promising paradigm to improve 
spectrum efficiency. D2D communication enables mobile de¬ 
vices in proximity to establish a direct link and to reuse the 
cellular spectrum. The advantages are manifold: offloading 
cellular traffic, eliminating coverage holes, improving spatial 
reuse, reducing battery consumption, and thereby enabling 
new services 0 (To) Provided that D2D communications 
share the same licensed spectrum resource with the co-located 
cellular users, majority of previous studies have focused on 
addressing the mutual interference between them GD-GI). 

Differently, in this paper we focus on exploring unlicensed 
TV white space (TVWS) for D2D communications in cellular 
networks. TVWS refers to TV bands that are not used by 
any DTV services at a particular time in a particular geo¬ 
graphic area GD- TVWS is also known as digital dividend, 
which emerges from the transition from analog to digital TV 
transmission |T^ . With the transition, large portions of the 
UHE spectrum (e.g., 512-608 MHz and 614-698 MHz in 
USA, 470-550 MHz and 614-782 MHz in UK) have been 
released for unlicensed devices as long as they do not create 
harmful interference to any licensed services. The TVWS in 
the released UHE spectrum is quite attractive since it has 
favorable propagation and building penetration characteristics. 

However, the development of D2D communications in 
TVWS poses critical challenges. Eirst of all, strict interference 
constraint should be met to protect the normal operation of li¬ 
censed DTV services GZ) |T^ . To develop efficient solutions, 
two main approaches have been suggested: i) spectrum sensing 
and ii) geolocation database. In the spectrum sensing approach, 
the availability of TVWS at a given location is modeled 
as a threshold-based hypothesis test |T9| . This approach, 
however, suffers from the hidden node problem because of 
shadowing [ [2Q| . On the contrary, according to |[^-[[26|, the 
geolocation database approach seems to provide a technically 
feasible and commercially viable solution in the near future. 
This approach provides a service that an unlicensed device can 
inquire the TVWS availability from a geolocation database, 
which predicts the availability of TVWS at any location using 
propagation modeling with high-resolution terrain data | [27| , 
One key limitation of this approach is that the accuracy 
of the TVWS availability provided by the database depends 
highly on the quality of the propagation modeling and the 
granularity of the terrain data | [29| , | [30| . 

Motivated by the opportunities and challenges described 
above, in this paper, we propose a novel mobile crowd sensing- 
driven geolocation database approach to explore TVWS for 
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D2D communications. The goal is to build a location-specific 
TVWS database, which provides a look-up table service for 
any D2D link to determine its maximum permitted emission 
power (MPEP) in an unlicensed digital TV (DTV) band. 
Before presenting details of the proposed approach, some key 
design rationales are stated as follows: 

(i) Using TVWS for D2D communications in cellular net¬ 
works is promising in terms of both technical and commercial 
viability. Just as a container filled with rocks still has room for 
sand, the geographical area not covered by large-scale DTV 
systems could be reused by small-scale D2D communication^ 
to improve the overall spatial spectrum utilization. Moreover, 
in comparison to other license-exempt spectral bands such as 
2.4 GHz and 5 GHz, the superior propagation properties of TV 
spectrum allows a higher transmission range at much lower 
energy requirements. Furthermore, instead of introducing a 
new operator to invest in a new infrastructure, it can be a 
promising business model for the cellular service providers to 
exploit TVWS, since they can offer more bandwidth to mobile 
users/devices by accessing extra spectrum with an already- 
deployed cellular infrastructure. 

(ii) There are much more spatial reuse opportunities for 
unlicensed devices in TV spectrum than we have already 
recognized. After carefully studying the recent released DTV 
coverage maps by FCC m and Ofcom p2| , we observe 
that for a given DTV transmitter, there are plenty of spatial 
coverage holes resulted from the signal attenuation due to 
either distance-related path loss or shadowing effect of irreg¬ 
ular terrain or buildings. Traditionally, shadowing has been 
considered as a physical barrier to reliable spectrum sensing 
because of the hidden node problem p0| . However, from 
a perspective of spatial reuse, we see that shadowing can 
greatly improve the isolation between the small-scale D2D 
communications and large-scale DTV services, which thus 
increases the flexibility in local spectrum usage and benefits 
the D2D communications by providing more white spaces. 

(hi) Parallel with the existing model-based approaches, 
data-driven approaches promise a new paradigm to explore 
TVWS availability. The popularity of various mobile wireless 
devices (e.g., smart phones, tablets, and in-vehicle sensors) 
equipped with programmable and powerful sensors makes it 
a good potential to learn the TVWS availability from massive 
active spectrum measurements or big spectrum data, which can 
serve as an alternative of the current sophisticated propagation 
models in the geolocation database approach (see, e.g., p7|). 

Based on the design rationales above, the main contributions 
of this paper are summarized as follows: 

• Formulate the spatial reuse of a TV channel between 
licensed DTV services and unlicensed D2D communi¬ 
cations as an optimization problem, where the permit¬ 
ted transmit power for an unlicensed mobile device is 
maximized, subject to i) a peak transmit power budget 
constrained by the hardware of that device, and ii) an 
interference probability threshold for the protection of 
licensed DTV receptions. 

^The transmission range of a DTV transmitter can be up to hundreds of 
kilometers, while the transmission range of a D2D communication link in 
cellular networks can be several meters to hundreds of meters. 



Fig. 1. An illustrative example of the network model used in this paper. 

• Introduce the idea of mobile crowd sensing to collect 
massive spectrum measurements from personal mobile 
devices, with the aid of the existing cellular base stations 
(BS). Each cellular BS is in charge of collecting, pre¬ 
processing, and mining the measurement data of its cell 
and the neighboring cells to form a localized TVWS 
database service for D2D communications in its vicinity. 

• Develop big spectrum data mining algorithms for each 
cellular BS to implement the task of TVWS geolocation 
database building: i) A fast matrix completion algorithm 
for obtaining complete spectrum status from a few known 
measurement samples, ii) a nonlinear support vector 
machine algorithm for performing irregular coverage 
boundary detection of a licensed DTV transmitter, and 
iii) an opportunistic spatial reuse algorithm for each D2D 
communication link to determine its MPEP. 

• Provide in-depth simulations under two critical and rep¬ 
resentative scenarios, which show that compared with the 
state-of-the-art approach, the proposed approach has two 
main advantages: i) improved spatial reuse between large- 
scale DTV services and small-scale D2D communications 
can be obtained; ii) reduced interference to the potential 
DTV receptions could be achieved. 

The reminder of this paper is as follows. Section II presents 
the system model. Section III presents the problem formulation 
and analysis. Section IV provides an overview of the proposed 
approach. Section V details the algorithm designs involved in 
the proposed approach. Section VI performs the performance 
evaluation and Section VII concludes the paper. 

II. System Model 

A. Network Model 

As shown in Fig. we consider a scenario that a DTV 
broadcasting network shares a TV channel with a cellular net¬ 
work. The high-power DTV transmitter has a spatial footprint 
up to hundreds of kilometers and its passive DTV receivers 
can be located anywhere inside its coverage area. In Fig. 
the covered area of the DTV transmitter is shown in shade 
and the uncovered area or TVWS is shown in white. The 
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irregular curve around the DTV transmitter represents the 
ground truth coverage boundary. The dashed circular curve 
denotes a commonly-used simplified coverage boundary with a 
protection region dp. The dash-dotted circular curve represents 
the boundary of a no-talk-region 1^ , which is specified as the 
sum of the DTV protection region and an additional separation 
named keep-out distance that depends on the peak transmit 
power of unlicensed devices. The co-located cellular network 
consists of dozens of inter-connected cells. Each cell has a 
cellular BS located at the center and massive mobile devices 
(e.g., smart phones, tablets, and in-vehicle sensors) randomly 
distributed around it. The mobile devices can operate in two 
communication modes: i) communicate with or through the 
associated cellular BS, and ii) directly communicate to each 
other and bypass the cellular BS. The latter mode, also known 
as D2D communication, is promising when devices are in 
close proximity to each other to provide higher throughput 
and enhance user experiences. 

B. Signal Propagation Model 

At a high level, a radio communication channel can be 
modeled as a multi-scale system with three major dynam¬ 
ics p0| : large-scale distance-dependent path loss L, medium- 
scale shadowing S due to fixed obstacles, and small-scale 
fading F due to destructive interference from multipath effects 
and small scatterers. Denote in dB as the transmit power 
of a transmitter located at position z and the received power 
at position x can be written as 

[dB] = P, - + 5'z^x + (1) 

For the propagation modeling of a wideband TV channel 
(e.g., 6 MHz in US and 8 MHz in UK), the small-scale fading 
P are generally averaged out from multiple measurements as 
suggested in p^ . Consequently, in the following, the main 
task of a signal propagation model is to predict the value of 
LpS in Eq. Q. Dozens of models have been proposed during 
the last 60 years p0| , which can be generally grouped into two 
classes: i) deterministic propagation model^ and ii) statistical 
propagation model^ 

In the following, we adopt a hybridized model, where 

• the deterministic path loss component is given as Pz^x = 
10alogio(<^z^x) + 201ogio(/) + 32.45 as suggested 
in p9| , where a is the path loss exponent depending on 
the specific propagation environment | [39| , dz^x in km 
is the distance between the transmitter and the receiver, 
and / in MHz is the center frequency, 

• and the random shadowing component is modeled as a 

Gaussian variable Pz^x ^ A/’(Pz^x, Notably, in 

this paper, we consider the case that the average shadow¬ 
ing loss Pz^x is environment (or location) dependent, 

^The Longley-Rice irregular terrain model (ITM) is one popular determin¬ 
istic model which takes into account a wide variety of factors such as 
terrain shapes, climactic effects, soil conductivity, earth’s curvature, etc dz) 
One key issue of this model is the high computational complexity. 

^The ITU-R model is a commonly-used statistical propagation model (3^ , 
in which all transmissions are assumed to be omnidirectional and the critical 
parameter is the distance to a DTV transmitter. The main drawback of this 
model is its inability to characterize the local variability in the received signal 
strength caused by irregular terrain effects. 


which can varies from 0 dB to tens of dBs pO} . 
is the shadow spread in dB. 

C. DTV Coverage Model 

Let Pmin denote the minimum required power level that a 
DTV receiver can reliably decode the desired signal from its 
DTV transmitter. The location coverage probability of a DTV 
receiver (located at x) with respect to its DTV transmitter 
(located at xq) is defined as 

P4r-.x = Pr{Pxo^x > (2) 

CFq 

where Q{-) is the standard Gaussian tail function and 
Pxo^x = Pxo - Lxo^x - Pxo^x is the average received 
power at location x. 

Definition 1: For an interested area A, denote a cow as a 
location coverage threshold, the coverage area of a DTV 
transmitter located at xq can be defined as the set of all 
coverage locations, i.e., 

-4x7= {x|P47-^x > *^cov, Vx G .4}, (3) 

or equivalently, 

.4^7= {x|Pxo^x > Pmin, Vx G .4}, (4) 

where Pmin — Pmin ^SxQ^x^ (^cov)* 

On the contrary, the area uncovered by the DTV transmitter 
located at xq, i.e., TV white space in A, can be given as 

^TVWS^ {x|j\,o^x < Vx G .4}. (5) 

Remark 1: Traditionally, to facilitate analysis, the DTV 
coverage area is usually simplified as a disc area with the DTV 
transmitter at the center. For instance, in ED> the effects of 
shadowing component Pxo^x,Vx G A, is neglected and only 
distance-related path loss is considered. In [ [4^ , the shadowing 
component is modeled as a Gaussian variable while the mean 
Pxo^x,Vx G .4 is assumed to be zero everywhere. However, 
in practice, the ground-truth coverage boundary of a DTV 
transmitter should be generally in irregular shape, resulted 
from the location-dependent shadowing variations in natural 
terrain morphology or man-made buildings. These shadowing 
variations can result in opportunities for TVWS exploitation 
that cannot be overlooked for the coexistence between small- 
scale D2D communications and large-scale DTV services. 

Remark 2: Compared with the existing disc models, the 
advantages of integrating location-dependent shadowing into 
the DTV coverage modeling are at least two folds: i) Improved 
spatial reuse can be obtained. Due to the effects of location- 
dependent shadowing, as shown in Fig. there can be some 
areas near the edge of or even inside the traditional disc 
coverage/protected region that are not covered by the DTV 
transmission. In those shadowing areas no DTV reception 
could work properly and thus spatial spectrum holes or 
TVWSs for unlicensed devices can be exploited, ii) Reduced 
interference to the DTV receptions near the edge of the 
ground-truth coverage area could be achieved. As illustrated in 
Fig. there are some areas covered by the DTV transmission 
while excluded by the traditional disc protected region, where 
the D2D communications will cause harmful interference to 
the potential DTV receptions. 
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D. Unlicensed Device Interference Model 


Let /max represent interference tolerance threshold of any 
DTV reception inside the DTV coverage area. The location 
interference probability of an unlicensed device located at 
with respect to a DTV receiver at x G is defined as 


p int 


= Pr{Px. 




T — P 

^ j 'I_^^-^max -^Xi^XN^ 

^ -tmaxj— }•) 


where Px^^x = Px^ - -t-xi^x - is the average 

interference power perceived by the DTV receiver. 

Definition 2: For an interested area A, denote z/int as a 
location interference threshold and the interference area of an 
unlicensed device located at x^ can be defined as the set of 
all interference locations, i.e., 


Ground-truth WCRP: Worst Case DTV Receiver Position 



.4“ = {xlPrL^C, > z.i„t,Vx G .4“;}, (7) 

or equivalently, 

{X|P,, > J^ax.x.^x,Vx G .4-;}, (8) 

where /max,x^^x =/max ^ (^int)“t“-Lx^^xH“^Xi^x- 

Remark 3: Since the locations of DTV receivers are gen¬ 
erally unknown a priori to unlicensed devices, a conservative 
assumption should be adopted fT^ that active DTV receptions 
can be located anywhere inside the DTV coverage area. More¬ 
over, as shown in Fig.[^ in previous studies (see, e.g., p8|), for 
any unlicensed device, a no-talk-region dn is specified as the 
sum of the DTV protection region dp and an additional keep- 
out distance da, which is the interference range that depends 
on the peak transmit power of that device. It is noted that 
the concept of no-talk-region simplifies the system design 
at the expense of significant TVWS loss, especially when 
efficient power control techniques can be used by unlicensed 
devices m- 

III. Problem Formulation and Analysis 

Given the DTV coverage model and the unlicensed device 
interference model established above, the problem of interest 
in this paper is to determine whether a TVWS exists for a 
given D2D communication link in an interested area A, and, 
if exists, its quantification. Technically, the problem can be 
expressed as: for a given unlicensed device located at position 
Xi G A, the objective is to maximize its transmit power P*., 
subject to i) the peak transmit power Ppeak constrained by its 
hardware and ii) the interference probability threshold z^int for 
protecting the licensed DTV receptions, i.e., 

OPl : P*. = maxPx- (9) 

subject to 

Pyii ^ L^peak, (10) 

Prx"Lx < Z^int, Vx G . (11) 

Based on Definition 2, we can combine the two constraints 
in OPl as follows 

Pxi < min{Ppeak 5 /maXjX^^x: Vxg.4^7}. (12) 


Fig. 2. Illustration for the optimal solution to problem OPl. The maximum 
permitted emission powers for device A, device B, and device C are P*. = 
Ppeak (white space), P*. G (0, Ppeak) (gray space), and P^. = 0 (black 
space), respectively. 


To facilitate the analysis, we introduce a notion named 
worst case DTV receiver position (WCRP) for each unlicensed 
device. As shown in Fig. the WCRP for an unlicensed 
device is the location that lies on the boundary of the DTV 
coverage and perceives the strongest interference from that 
device. Now, we can rewrite © as 

/xi ^ rnin{Ppeak5-^max,Xi^xt (13) 

where x^ denotes the WCRP for the device located at x^. 

Intuitively, the optimal solutions to OPl for devices located 
at various locations can be depicted as in Fig. It is noted 
that the optimization problem OPl is quite difficult to tackle 
due to the following concerns: 

(i) The boundary irregularity of DTV coverage area makes 
it mathematically intractable to obtain an optimal solution. If 
the boundary is simplified to be circular as shown in Fig. 

positioning methods can be used to obtain an 'optimaT 
solution as in | [42| . However, in practice, the ground-truth 
boundary is generally in arbitrary shapes and the optimal 
solution relies highly on the precise estimation of a local radio 
environment map | [43| for each unlicensed device, which is a 
nontrivial task for individual devices. 

(ii) Strict interference constraint is required to protect li¬ 
censed DTV receptions while limited spectrum measurements 
are available for each unlicensed device. Collaborative sharing 
of measurement data among neighboring devices is a feasible 
way to improve the estimation of radio environment map 
for all devices 1^ . However, the energy and delay cost 
involved in the information exchange and the deployment cost 
of dedicated sensors may make it commercially unwelcome. 

(iii) Spatial reuse opportunities for unlicensed devices at 
different locations are generally heterogeneous. As pointed out 
in p^ , at a given time, devices (even neighboring devices) 
at various locations may have different MPEPs (see the 
illustrative example in Fig. [^, which makes a global solution 
infeasible. 








5 


These observations above motivate us to propose a techni¬ 
cally feasible solution for the interesting but critical problem 
OPl. Specifically, Section IV provides an overview of the 
proposed approach. Section V details the algorithm designs 
and Section VI presents the performance evaluation. 

IV. Overview of The Proposed Solution 
A. Key Ideas 

One key insight is that, as an alternative to the existing 
propagation model-based geolocation database approach | [27| , 
| [28| , we can build a location-specific TVWS database by using 
active spectrum measurements with the aid of the already- 
deployed cellular infrastructure. Specifically, mobile devices 
equipped with programmable sensors can contribute massive 
location-aware spectrum measurements and each cellular BS 
can collect and process big spectrum data to form a localized 
TVWS database, which in turn provides a spectrum service 
for those mobile devices. Considering the fact that the spatial 
coverage of a DTV transmitter is relatively static, the spectrum 
measurements can be collected and updated in an offline 
and asynchronous manner. Moreover, with a large amount of 
collected spectrum data, it is possible to implement an exten¬ 
sive exploitation of TVWS by exploring intrinsic PHY-layer 
signal propagation characteristics of both distance-dominant 
path loss and location-dependent shadowing effect, which 
will undoubtedly provide more spatial reuse opportunities for 
small-scale D2D communications. In addition, recent advances 
in (big) data mining technologies make the proposed data- 
centric solution technically feasible. 


B. An Overview 

Fig. depicts an overview of the proposed solution, which 
can be implemented in the cellular system without requiring 
any modification to the existing infrastructure. Specifically, the 
proposed approach resides at each cellular BS and consists of 
the following sequential four building blocks: 

• Mobile crowd sensing that collects location-aware spec¬ 
trum measurements from massive personal mobile de¬ 
vices with positioning capability. 

• Matrix completion that recovers unknown spectrum 
data at locations that are lack of measurements based 
on some known spectrum data through effective spatial 
interpolation. 

• Boundary detection that determines the irregular cov¬ 
erage boundary of the DTV transmitter by effectively 
mining the completed big spectrum data. 

• Spatial reuse that enables the unlicensed D2D communi¬ 
cation links to opportunistically share the same spectrum 
with the licensed DTV services. 


V. Algorithm Designs of The Proposed Solution 

In this section, we detail the algorithm design of each 
building block in the proposed solution. 
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Fig. 3. The building blocks of the proposed solution. 


A. Mobile Crowd Sensing for Spectrum Measurements 

Spectrum measurement data is often obtained by deployed 
sensors with specialized equipments. Alternatively, in this 
paper, we introduce a novel information gathering technique, 
namely mobile crowd sensing (MCS) | [45| , to perform spec¬ 
trum measurements with personal devices such as smart¬ 
phones, tablets, and in-vehicle sensors. The key idea is that, 
with massive personal devices, each contribute a small amount 
of sensing data to ultimately obtain a sufficiently large dataset. 

As shown in Fig.j^a), in the proposed MCS-based spectrum 
measurement scheme, each mobile device uses a modern 
'mobile app' to perform spectrum sensing and report the 
sensed DTV signal strength coupled with its current location 
information to its cellular BS through, e.g., a common control 
channel. Each cellular BS uses a local operational database to 
collect and store the spectrum measurement data. 
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Some key issues in MCS-based spectrum measurement 
scheme are discussed as follows. 

1) Trustiness of Contributions: The first issue is how to 
ensure the quality of spectrum data from a large group of low- 
end personal devices that may be unreliable, untrustworthy, or 
even malicious. Specifically, the sensing report from a crowd 
spectrum sensor m at location x can be expressed as: 




P: 


Xo^X 


(14) 


Energy detector output,Abnormal data component 


where is the output of the energy detector including the 
received DTV signal component Watt) and the 

Gaussian noise component with mean Nq and variance 
= (Pxo^x • l{Wi} + A^o)^/W^ 3 am- Notably, the abnormal 
data component is denoted as a^, which is zero if y'^ is a 
normal data and nonzero if y'^ is abnormal. 

To effectively mitigate the uncertainty of sensing data from 
crowd spectrum sensors, in the previous work | [46| , we have 
developed a data cleansing algorithm to robustly cleanse out 
the nonzero abnormal data component from the original cor¬ 
rupted sensing data, and moreover, in p0| we have designed 
a unsupervised clustering algorithm to distinguish spectrum 
attackers from reliable sensors based on dissimilarity analysis 
of historical reports, both of which can be applied to improve 
the quality of crowd spectrum data. 

2) Incentive to Contribute: The second issue is how to 
motivate personal devices to participate in spectrum mea¬ 
surements and contribute the sensed data. While participating 
in a crowd sensing task, mobile devices consume their own 
resources such as battery and computing power, and expose 
themselves to potential privacy threats by sharing their sensed 
data with location information. However, we could consider 
a simple incentive mechanism that devices which contribute 
spectrum data will receive a reward in terms of bandwidth and 
discounted or free communication minutes once TVWS is found 
available. More advanced incentive mechanisms for general 
MCS systems, based on game theory and auction theory, can 
be found in | [47| and more general rules to motivate human 
cooperation can be found in | [48| , | [49| . 

3) Spectrum Data at Each Cellular BS: Another key issue 
is what type of spectrum data should be collected at each 
cellular BS. Briefiy, the spectrum data at each cellular BS can 
be considered as multi-dimensional (i.e., time, frequency, and 
space) spectrum measurements (i.e., the received DTV signal 
power levels) from massive personal devices. Specifically, 
considering the spectrum data in time domain, for a given 
frequency band and a given location, spectrum measurements 
can be collected over multiple days in an offline and asyn¬ 
chronous manner. For the spectrum data in frequency domain, 
spectrum measurements can be from multiple TV channels. 
From the perspective of space domain, spectrum data at each 
cellular BS refers to spectrum measurements from an area of 
interest A, including both its cell (that can be directly collected 
from personal devices in its coverage area) and the neighboring 
cells (that can be obtained from the neighboring cellular BS’s 
via backh^l). To facilitate the following processing, when we 
consider ^ as a Lc x Lc square area as shown in Fig. 



Fig. 4. Area of interest A at each cellular BS. -Rceii is the cellular 
radius, rcov is the coverage range of a mobile device, and rint denotes the 
interference range when the peak transmit power is used by the D2D link. 


the spatial range of A is determined by the cellular radius 
RceW and the worst-case interference range rim of a D2D 
link located at the edge of its cell, that is. 


Lc = 2(i?cell + nnt)- (15) 

4) Uplink Overhead Analysis: The use of mobile crowd 
sensing requires personal mobile devices to send their spec¬ 
trum data to the cellular BS. This will result in additional 
uplink overhead in the cellular network for collecting spectrum 
measurements. Specifically, for a given cell, suppose that 
in a time period T, there are A^ceii spectrum measurements 
collected by that cellular BS from Mceii mobile devices and 
each measurement is of B bits, then the cell average uplink 
overhead is x b average uplink overhead 

for each device is ^^^ bits/s. To decrease the uplink 
overhead, it is straightforward to increase T and Mceii, and 
decrease Vceii and B. Considering the fact that the spatial 
coverage of a DTV transmitter is relatively static, the spectrum 
measurements can be collected, from a variable number of 
mobile devices, in an offline and asynchronous manner over a 
relatively long time period T (e.g., ranging from a few minutes 
to multiple days or even longer). Moreover, the effective 
spatial interpolator developed in the following subsection 
allows us to recover unknown spectrum data from a few known 
spectrum samples, which in essence decreases A'ceii- 


B. Matrix Completion for Unknown Measurements Recovery 

In MCS-based spectrum measurement scheme, the collected 
data at each cellular BS depends highly on individual human 
activities (5T), p^ . Consequently, there may be some loca¬ 
tions or areas that are lack of (reliable) measurements. An 
effective spatial interpolator is vital for recovering unknown 
spectrum data from a few known samples. Most existing 
spatial interpolators can be represented as linear-weighted 
averages of sampled data and the weight designs are generally 
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heuristic, such as the well-known inverse distance weight¬ 
ing 15^ and Kriging weighting | [54| . Many studies in various 
disciplines (see, e.g., | [55| ) have shown that there is no simple 
answer regarding the choice of an appropriate weight design, 
because a method is “best” only for specific situations, such 
as the spatial configuration of the data and the underlying 
assumptions. 

To avoid the need of explicit weights design, in this paper, 
we formulate the problem of unknown measurements recovery 
as a matrix completion problem and propose to apply a 
powerful fixed point continuation algorithm (FPCA) [ [56| to 
reconstruct the unknown elements from its known elements. 
As shown in Fig. I^b), each cellular BS first divides the area 
of interest A into a set of small square grids. In each grid, if 
the number of the collected (reliable) measurements (i.e., the 
sensed DTV signal strengths) is large enough (e.g., >> 10), 
the mean of them can be taken as an estimation of the average 
received DTV signal strength in that grid. Otherwise, that grid 
will be set as ‘unknown’. 

Furthermore, the spectrum dataset at each cellular BS can be 
written as a p x m matrix M, with the entry j denoting the 
sensed average DTV signal strength in the grid. Due 

to the reasons mentioned above, after collecting the spectrum 
measurements for a given time period, the cellular BS can 
only obtain a subset E C [p] x [m] of M’s entries. The partial 
observation of M is defined as a p x m matrix with the 
entry as 


Mij, if (z, j) e E 
0, otherwise. 


( 16 ) 


We shall recover the unobserved elements of M from M^, 
which can be implemented by solving the following nuclear 
norm minimization 



Fig. 5. The spectrum measurements recovery performance under various 
sampling rates. Here the sampling rate refers to the percentage of known 
elements in the spectrum data matrix M. The detailed parameter configuration 
of this simulation can be found in Section VI-A. 


on converge when the following stopping criterion is 
reached [ [56| : 




(19) 


where 11 • 11 i? is the Frobenius norm and is a small positive 
scalar (e.g., 10“^). 

Fig. shows the spectrum measurements recovery perfor¬ 
mance, in terms of recovery root square error (RSE), which 
is defined as 


RSE[dB] = lOlogio (20) 


min r||M| 
mgrpx^ 


5 E 




E 12 


(17) 




where ||M||^ denotes the nuclear norm of matrix M and 
r is a scaling parameter that balances the first term and 
the second term in ini'- For notational simplicity, a linear 
operator V is introduced to select the components E out of a 
p xm matrix and form a vector such that | |7^(M) — M^| ^ = 

We adopt FPCA here to solve GZl' for its outstanding 
performance in fast completion of large-scale matrices. Briefly, 
FPCA is based on the following fixed-point iteration |[56|: 


'yfe = m'' - APyP(M'') - 




/c+1 


= SrAiY’^), 


(18) 


where A is a step size, V* denotes the adjoint of V, and Su{') 
is the matrix shrinkage operator (see Appendix A for details). 

It is observed that the first step of ( p^ is a gradient-descent 
applied to the second term in which thus reduces the 
second term while generally increases the first term (i.e., 
the nuclear norm) in ^Tf) . In contrast, the second step of 


(18) is to reduce the nuclear norm of Y . Iterations based 


where M denotes the recovered spectrum dataset and G 
represents the ground-truth spectrum dataset with the entry 
Gij as the mean received DTV signal strength at the center 
of the (i, j)-th grid. It is observed in Fig. that: 

• The recovery error generally decreases with an increasing 
number of measurement samples in each grid (i.e., Ngam) 
as well as the sampling rate (i.e., the percentage of known 
elements in the total spectrum data matrix M). 

• Smaller grid size or higher spatial resolution yields better 
recovery performance. 

• The recovery RSE approaches -20 dB when the spatial 
resolution is 80 x 80 m and the sampling rate is no smaller 
than 30 %. 

Computational complexity analysis: In each iteration of 
a singular value decomposition (SVD) has to be com¬ 
puted to perform the matrix shrinkage operation, which has 
a high computational cost. Consequently, FPCA uses a rank- 
r approximate SVD to replace the whole SVD, where r is 
the estimated rank. Numerical experiments show that FPCA 
is very robust and not sensitive to the accuracy of the approx¬ 
imate SVDs. Moreover, a continuation strategy is adopted by 
FPCA to accelerate the convergence of ( p^ and reduce the 
entire computation time, which solves a sequence of instances 














of ( p^ , from easy to difficult, corresponding to a sequence of 
large to small values of r |[56|. 


C Support Vector Machine for Coverage Boundary Detection 

Based on a complete matrix of spectrum data, in this 
section, we develop effective algorithms to reliably detect the 
DTV coverage boundary (see Fig. |^c)), which is the key 
information for spatial reuse. 

As mentionedJn Section V-B, for a given cellular BS, the 
area of interest A has been divided into a set of square grids. 
Specifically, the 2-D coordinates of the (i, j)-th grid center is 
denoted as kj := (xi^yj) e A.i ^ ^ 

anc^the corresponding recovered average DTV signal strength 
is Mij. Based on the energy detection, the cellular BS makes 
a binary test hypothesis for each grid as follows: 

Mij I P, (21) 

j )= + l 

where P := Prnin — Sp is the detection threshold, with P^in 
denoting the minimum average received signal power that a 
DTV receiver can reliably decode the desired signal, which has 
been defined in Definition 1, and Sp being an offset parameter 
to compensate the imperfection of recovered spectrum data, 
which is positive for a conservative design to protect licensed 
DTV services and negative for an aggressive design to improve 
the unlicensed spectrum reuse. h{lij) is a binary declaration 
on whether location lij is covered (h{lij) = —1) by the DTV 
transmission or not (h{lij) = +1). 

Due to the hardware constraint of personal devices and radio 
channel randomness, the declarations are generally not error- 
fre^ We, however, have no prior knowledge which decla¬ 
rations are correct. Based on the declarations with potential 
errors, the objective of coverage boundary detection is to find 
a function / with the minimal detection errors, i.e., 

where hf{lij) G {—1,+1} denotes the coverage state at kj 
determined by the boundary function / (i.e., if f{kj) > 
l,hf{k,j) = +1; else if /{kj) ^ -l,hf{lij) = -_1) and 0 
is a binary operator defined as: if hf{lij) = hf{lij)^ 

~ if ® = +1- 

To provide an efficient solution to ( [22| ), support vector 
machine (SVM) | [57| serves as a promising theoretical tool. 
In the following we start with the formulation of the problem 
using simple linear SVM, and then extend to the design of 
nonlinear SVM classifiers by effectively kernelizing the linear 
SVM. 

Specifically, linear SVM attempts to find a separating hy¬ 
perplane (corresponding to a linea^coverage boundary in this 
problem) {wj)pb = 0,/ G A with the largest margin 
satisfying constraints: 

(w, kj) 0 6 > 0, for h{lij) = 01; 

(w, lij) 0 6 < 0, for h{lij) = -1. (23) 

shown in Fig. ic), circles inside the ground-truth coverage and solids 
outside the ground-truth coverage are erroneous declarations. 


where w is a weight vector and b is the intersect. 

The optimal separating hyperplane can be derived by solv¬ 
ing the following optimization problem | [57| : 

1|| ||2 
mm -||w|| 

w,6 2 

subject to h{lij){{wJij) -\-b) > G A. (24) 


To further consider the potential erroneous input (decla¬ 
rations), a regularization parameter C > 0 is introduced 
to balance the tradeoff between the maximization of margin 
width and the penalty to the errors: 


1 P m 

min 

’ i=l j = l 

subject to h{lij){{w, kj) 0 6) > 1 - G A, (25) 


where > 0 is a slack variable to refiect the impact of 
erroneous declarations. Furthermore, by introducing Lagrange 
multipliers aij > 0,i G {1,2, j G {l,2,...,m}, the 


dual form of (25) can be expressed as 

pm ^ p m p m 

2 


Jp I I V Jp I I V Jp I I V 

-- EE“..-^EEEE )^(^/c,s) {^i,j 5 ^k,s 


i=l j=l i=l j=l k=l s=l 

p m 

subject to = 0, 0 < aij < C. (26) 

i=i j=i 


The optimal solution to (26) can be found by a 

quadratic programming solver |58|, and V/ G A, the decision 


function for the linear boundary can be expressed as: 

p m 

f{l) = signC^y2^lj^ikj){l>kj) + ^*)> (27) 

i=i j=i 


where sign(') is the sign function and the threshold 
b^ can be obtained by averaging bk^s = ^(4,s) “ 

Yl'jLi c^i,jHh,j){h,s,k,j) over all locations lk,s ^ A. 

In practice, the ground-truth DTV coverage boundary is 
generally in nonlinear and irregular shape because of the 
signal attenuation resulted from obstructions such as hills or 
buildings p0| . Thanks to kernel trick | [50| , we can derive 
more general nonlinear and irregular boundary functions by 
replacing the inner product operator in the linear SVM with 
an appropriate kernel function k(-) as follows: 


{h,j’)^k,s) ^ ^{h,j^k,s)'^h,j’)^k,s ^ A. (28) 


Among others, the most widely used kernels include the 
(projective) polynomial kernel 

Hk,j,lk,s) = i{k,j,lk,s) + cY, C> 0 ,de N+; (29) 

and (radial basis function, RBF) Gaussian kernel 

Hkj,lk,s) = exp{-\\lij - lk,s\\l/2a‘^), C7 > 0; (30) 

where c, d, and a are kernel parameters, which are generally 
estimated from the training dataset | [50| , | [58| . These kernels 
implicitly map the data in original 2-D space onto a higher¬ 
dimensional feature space, where data are generally more 
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Grid Size (m) 

Fig. 6. The boundary detection performance in settings of various spatial 
resolutions. Each point is obtained by averaging 100 independent runs. In 
each run, uniform sampling is firstly implemented to obtain and matrix 
completion algorithm in Section V-B is then used to obtain M. After that, SVM 
algorithms in Section V-C are used to derive the DTV coverage boundary. 


TABLE I 

System parameters used in simulations 


Parameter 

Value 

Comment 

B 

6 MHz 

Bandwidth of a TV channel 

f 

615 MHz 

Center frequency of the TV channel 


90 dBm 

Transmission power of a DTV transmitter 

{xo,yo) 

(0,0) 

2-D coordinates of the DTV transmitter 

No 

-95.2 dBm 

Noise power 

dp 

134.2 km 

Radius of DTV protection region 


4 

Path loss exponent of DTV transmission 


2.5 

Path loss exponent of D2D transmission 

fj 

5.5 dB 

Shadowing dB-spread 

p . 

^ min 

-92.2 dBm 

Minimum power to decode the desired signal 

-^max 

-98.2 dBm 

Interference power tolerance threshold 

^cov 

0.9 

Location coverage probability threshold 

i^int 

0.1 

Location interference probability threshold 

A 

8 km X 8 km 

Area of interest at each cellular BS 

Lg 

80 m 

Grid size 

sr 

50% 

Sampling rate 

Lt sam 

100 

Number of measurements in each grid 

Pcell 

2 km 

Cellular radius of each cell 

Ppeak 

-10 dBm 

Peak transmit power of a personal device 

Tint 

2 km 

Worst-case interference range of a device 


separable. Consequently, the linear decision boundary function 
in can be extended to a nonlinear one as: 

p m 

f{l) = signC^J2+ f>*)- ( 31 ) 

i=i j=i 


From a statistical point of view, Fig. [^presents the boundary 
detection performance of two representative SVM algorithms 
in terms of successful detection probability, which denotes the 
probability that the grids inside the ground-truth DTV cover¬ 
age area are detected as inside or the grids outside the coverage 
are detected as outside. It is shown in Fig. that: i) Smaller 
grid size or higher spatial resolution yields better detection 
performance; ii) Higher sampling rate generally gives better 
performance; iii) RBF Gaussian SVM algorithm outperforms 
quadratic SVM for its superior capability of characterizing 
irregular boundary. 

Computational complexity analysis: The core of an SVM 
is a quadratic programming (QP) problem as shown in (26) 
and the computational complexity of QP solvers is dataset 
dependent, scaling between 0{N‘^) and 0{N^) where 
N denotes the number of training samples. 


worst-case interference area that is determined by its peak 
transmit power Ppeak^ we say grid li^ is located in a complete 
white space and the device there can transmit in its peak 
transmit power P*. = Ppeak (see, e.g., device A in Fig. 1^; 

Case III: h{lij) = +1 while there is a segment or the 
detected DTV coverage boundary covered by the device’s 
worst-case interference area, we say grid kj is located in 
a gray space and the device there can only transmit with 
the power P*. G (0, Ppeak) (see, e.g., device B in Fig. 1^. 
Furthermore, P*. = /max,xi^xt • denotes the WCRP for the 
device located at x^, which is the location that lies on the DTV 
coverage boundary and perceives the strongest interference 
from that device. 

Remark 4: In summary, based on the designed algorithms in 
Section IV, each cellular BS can compute and store the MPEPs 
for all locations inside its cell, which form a localized TVWS 
database. Any D2D communication that needs to transmit in an 
unlicensed TV spectrum band can submit an inquiry associated 
with its current location to its cellular BS. After receiving 
the inquiry request, the cellular BS makes a table look-up 
operation in its database and sends back the information of 
the MPEP that device can use without disrupting the normal 
operation of the licensed DTV services. 


D. Opportunistic Spatial Reuse between a D2D Communica¬ 
tion Link and the DTV Services 

As shown in Fig. |^d), when the detected DTV coverage 
boundary is obtained, each cellular BS can compute the MPEP 
for any location inside its cell. Specifically, for any device 
located at (inside the (i, j)-th grid), an opportunistic spatial 
reuse algorithm is proposed as follows: 

Case I: If h{lij) = —1, we say grid Uj is covered by the 
DTV or located in a black space and thus no transmission is 
allowed to protect the potential DTV receptions, i.e., P^. = 0 
(see, e.g., device C in Pig. [^; 

Case II: h{lij) = +1 and there is no point of intersection 
between the detected DTV coverage boundary and the device’s 


VI. Pereormance Evaluation 
A. Simulation Setup 

We consider a wireless environment that a cellular network 
of small-scale mobile devices opportunistically and spatially 
reuse a TV channel licensed to a large-scale DTV system. 
Table I lists the system parameters used in the following 
simulations. The parameters of the DTV system and D2D 
communications are mainly based on the specifications in |T^ 
and |[8|, p^ , respectively. The grid size, sampling rate, and 
number of measurements in each grid are based on the results 
obtained in Section V-B and V-C. _ 

Por each cellular BS, the area of interest A can be located 
i) completely inside the DTV coverage, or ii) completely 
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(a) Ground truth coverage (b) After random sampling (c) After matrix completion (d) After SVM 


Fig. 7. Simulation results of the proposed approach under Scenario I (Part I). The shaded disc area in subfigure(a) and subfigure(d) (around the centered 
cellular BS) represents the cell range of the cellular BS. 


outside the DTV coverage, or hi) partially inside partially 
outside the DTV coverage. Technically, the third case is much 
more challenging to enable the co-channel deployment of a 
DTV system and D2D communications, since the interference 
management in this case is nontriviaQ Consequently, in the 
following simulations, we will focus on two representative 
scenarios of the third case: 

Scenario I: The cellular BS is located at the edge of the 
DTV protection region, which is also named as the noise- 
limited coverage contour in GZ) |T^ . In this scenario, any 
D2D link located inside the DTV protection region is not 
allowed to transmit, while D2D links located outside can be 
allowed to transmit without causing harmful interference to the 
potential DTV receptions inside the DTV protection region. 

Scenario II: The cellular BS is located inside the DTV 
protection region, however, a portion of its cell falls into 
a shadowing zone resulted from large obstacles such as 
mountains or a group of buildings. In this scenario, DTV 
receptions cannot work in the shadowing zone, which thus 
forms white space for small-scale D2D communications. Any 
D2D link located inside the shadowing zone should be allowed 
to transmit without introducing harmful interference to the 
DTV receptions outside the shadowing zone. 

B. Simulation Results under Scenario 1 

Under the representative simulation Scenario I, Fig. |7] 
presents the results to show the role of each building block 
algorithm of the proposed approach. In this simulation, we 
first generate the ground-truth DTV coverage (see Fig. |7ja)) 
as the baseline reference. A cellula^BS is assumed to be at the 
center of the 8 km x 8 km area A, which is further divided 
into 100 X 100 small grids. It appears that the ground-truth 
DTV coverage boundary (separating the blue and red areas) is 
almost linear since the cell (the shaded disc area) in Scenario 
I is located at the edge of the DTV coverage and the range of 
the DTV coverage is much larger than the range of the cell. 

Then, mobile crowd sensing is used at the cellular BS 
to collect spectrum samples and only 50% locations are 
considered to be sampled in Fig. |7jb). Intuitively, the DTV 
coverage is quite difficult to be recognized from the highly 

is noted that interference management for D2D communications in TV 
spectrum is more challenging than in cellular spectrum, since the explicit 
coordination/cooperation from the licensed DTV users is unavailable. 


uncompleted and noisy spectrum data. To tackle this challenge, 
the matrix completion algorithm developed in Section V-B is 
used to recover the unknown measurements (see Fig. |^c)). 
It is shown that the recovered DTV coverage in Fig. [TJc) 
becomes more recognizable than that in Fig.[7jb), however, the 
boundary separating the recovered blue and red areas is still 
quite blur. Therefore, the SVM algorithm developed in Section 
V-C is further used to derive the DTV coverage boundary (see 
Fig.jTJd)). Through comparing Fig.|7Jd) and Fig.|7Ja), it seems 
the proposed approach performs quite well. 

Furthermore, Fig. [^presents additional results to show the 
effectiveness of the proposed approach over the traditional 
model-based approach p7| , | [4^ . Specifically, considering all 
the locations inside the cell (i.e., the shaded disc area in 
Fig. [TJa)) of the centralized cellular BS, from the statistical 
view in terms of empirical cumulative distribution functions 
(CDFs), Fig. [^a) shows the distribution of the derived MPEPs 
at various locations minus the corresponding ground-truth 
MPEPs, and Pig. 8(b) shows the distribution of the derived 
interference probabilities (IPs) at various locations minus the 
IP threshold ^nt = 0.1. 

It is shown in Pig. [8] that: 

• In the traditional model-based approach, the biases of 
MPEPs range from -0.08 to 0.1 and the biases of IPs 
range from -0.1 to 0.9, and the distributions of both of the 
biases are closely related to the specific localization error, 
which is a key parameter to limit the performance of the 
model-based approach. According to p7| , the simulation 
setup of the maximum localization error is 50m for GPS, 
150m for Wi-Pi, and 1000m for GSM-based localization, 
respectively. 

• Relatively, the proposed approach has much smaller 
ranges for both of the biases of MPEPs and the biases 
of IPs, and the maximum of both of the biases can 
be smaller than 0 by adjusting the offset parameter 5p 
(defined in Eq.([^), which means that the proposed 
approach can successfully enable D2D communications 
in TVWS while satisfying the interference constraint 
from the licensed DTV services. Moreover, as mentioned 
in Section V-III, 6p is 3. parameter to compensate the 
imperfection of recovered spectrum data. Generally, a 
larger 6p corresponds to a more conservative design, 
which means the derived MPEPs are much smaller than 
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(a) Distribution of biases of MPEPs. (b) Distribution of biases of IPs. 


Fig. 8. Simulation results under Scenario I (Part II). Considering all the locations inside the cell (i.e., the shaded disc area in Fig. ^a)) of the centralized 
cellular BS, from the statistical view, the left subfigure shows the distribution of the derived MPEP minus the corresponding ground-truth MPEPs, and the 
right subfigure shows the distribution of the derived interference probabilities minus the interference probability threshold z/int =0.1. 



116 117 118 119 120 121 122 123 


x(km) 

(a) Ground truth coverage (b) After random sampling (c) After matrix completion (d) After SVM 





Fig. 9. Simulation results of the proposed approach under Scenario II (Part I). The shaded disc area in subfigure(a) and subfigure(d) (around the centered 
cellular BS) represents the cell range of the cellular BS. 


the ground truth ones and the corresponding IPs are much 
smaller than the threshold z/int = 0.1. 

In brief, compared with the traditional model-based ap¬ 
proach, the advantages of the proposed approach are mainly 
twofold: i) improved spatial reuse between large-scale DTV 
services and small-scale D2D links can be obtained; ii) re¬ 
duced interference to the DTV receptions near the edge of the 
ground-truth coverage area could be achieved. 


C. Simulation Results under Scenario II 


Fig. and Fig. 10 shows the simulation results under 
Scenario II. As shown in Fig. [^a), in this simulation, we 
consider the case that a cellular BS located at (119.2,0), which 
is inside the DTV protection region. The shaded disc area 
in Fig. I^a) (around the centered cellular BS) represents the 
cell range of that cellular BS. There is a shadowing zone 
due to signal attenuation resulted from large obstacles. As an 
illustrative instance, the signal attenuation from x = —116.2 
km to X = —120.2 km linearly decreases from -20 dB to 0 
dB. Comparing with the Scenario I, this scenario is much more 


complex since the TVWS is located inside the DTV protection 
region and the coverage boundary is much more irregular. 
Intuitively, through comparing Fig.|^d) and Fig.|^a), it seems 
the proposed approach performs well. 

Moreover, it can be observed in Fig, that: 

• In the traditional model-based approach, the biases of 
IPs for all setups of the localization error are -0.1, 
which means that all the derived IPs in the model-based 
approach is zero, considering the interference probability 
threshold = 0.1. In other words, all the derived 
MPEPs are zero and no transmission is allowed in this 
scenario, which results in a waste of spatial reuse oppor¬ 
tunities of TVWS for D2D communications. 

• Differently, the proposed approach can enable D2D com¬ 
munications in this kind of TVWS while satisfying the 
interference constraint from the licensed DTV services at 
more than 90% locations inside the cell for all the three 
setups of the offset parameter 5p (defined in Eq.([2T])). 
Eurthermore, a larger dp corresponds to smaller inter¬ 
ference to potential DTV receptions and larger biases of 
MPEPs. Under this simulation scenario, = 3 is a better 
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Model-based approach 



Model-based approach 


Q 

o 

0.5 
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- A - Loc-error = 1000m 


O'— 

-1.5 


-0.5 0 

Bias of interference probability 
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Fig. 10. Simulation results of Scenario II (Part II). 


choice to balance the tradeoff between the protection to 
DTV receptions and the spatial reuse opportunities of 
D2D communications. 

To our best knowledge, this is the first try to explore and 
exploit TVWS inside the DTV protection region. Potential ap¬ 
plication scenarios include (but not limited to) communications 
between internet of vehicles in the underground parking, D2D 
communications in hotspots such as subwaym game stadiums, 
and airports, etc. 

VII. Related Work 

During the past few years, the idea of enabling D2D 
communications in cellular networks for handling local traffic 
has gained growing attention. The prior studies in 
have shown that better resource utilization can be achieved by 
non-orthogonal spectrum sharing between D2D communica¬ 
tions and cellular networks. Among many others, the authors 
of fTT|-fT4]| have proposed various interference management 
schemes to coordinate D2D and cellular users for achieving 
improved spatial reuse of the cellular spectrum. The work 
of this paper is complementary to prior studies and the 
goal here is to explore unlicensed TV spectrum for D2D 
communications with the assistance from cellular networks. 
The sharing of TV spectrum between D2D and DTV users 
brings unique challenges, mainly due to the lack of explicit 
signaling cooperation. 

The idea of opportunistic access of TVWS for cellular 
networks has also received increasing interests. A related work 
in p4| proposed a spectrum sensing-based mechanism to 
explore TVWS for cellular users, where a simplified circular 
DTV coverage model was assumed and collaborative sensing 
was done among neighboring cellular BSs with fixed topology. 
In contrast, our proposed approach introduces mobile crowd 
sensing to collect spectrum measurements from massive per¬ 
sonal devices and exploits TVWS in a much finer granularity 
by considering the practical irregular DTV coverage. 


Based on the guideline provided by FCC | [2l] |, several 
TVWS database systems have been developed by compa¬ 
nies and very recently, many of them have entered 

the phase of public testing Technically, in p7| , the au¬ 
thors presented a database system, named SenseLess, in which 
unlicensed devices rely on a geolocation database service to 
determine TVWS availability without spectrum sensing. One 
common feature of these database systems is that they are 
model-based approaches in essence, since the database service 
is provided by using a combination of sophisticated signal 
propagation modeling, fine terrain data, and an up-to-date 
parameters of the DTV transmitters. Alternatively, this paper 
presents a data-driven approach to build a database by learning 
the TVWS availability from big spectrum measurement data. 

As opposed to propagation model estimates, there are few 
related work that use actual measurements to build spatial 
TVWS maps or database. One representative research work 
has been done by the European project FARAMIR (during 
2010-2012) (see, e.g., p5]-|[^), where extensive spectrum 
measurements have been conducted at several locations in 
Europe to provide a valuable basis for spectrum use modeling 
in time, frequency, and space, and to increase the radio envi¬ 
ronmental and spectral awareness of future wireless systems. 
In p9| and pQ| , a large set of active measurements have 
been collected to evaluate the accuracy of propagation models 
in making radio link predictions, where a conclusion has 
been reached that these models can be used for nationwide 
coverage planning, but perform poorly at predicting accurate 
path loss even in relatively simple outdoor environments 
and more complex models that consider a larger number of 
variables (e.g., terrain, climactic, soil conductivity, etc) do not 
necessarily make better predictions. These studies reinforce 
the motivation of this paper, which extends those studies by 
developing effective data mining algorithms to build TVWS 
database from actual measurements. 
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VIIL Conclusions 

In this paper, we proposed a mobile crowd sensing-driven 
geolocation spectrum database for device-to-device communi¬ 
cations (D2D) in TV white space. This paper represents the 
first try to highlight that both the distance-based path loss 
and the location-dependent shadowing effect can improve the 
isolation between the small-scale D2D communications and 
large-scale DTV services, which increases the flexibility in 
local spectrum usage and benefits the D2D communications 
by providing more TVWS opportunities. We formulated the 
spatial reuse of a TV channel between licensed DTV services 
and unlicensed D2D communications as an optimization prob¬ 
lem. To obtain an effective solution, we presented a systematic 
approach consisting of mobile crowd sensing for spectrum 
measurements collection, fast matrix completion algorithm 
for unknown measurements recovery, nonlinear support vector 
machine algorithm for irregular coverage boundary, and op¬ 
portunistic spatial reuse algorithm for deciding the maximum 
permitted emission power (MPEP) for each D2D link. One 
interesting but critical research direction is to investigate the 
coexistence of multiple co-channel D2D links, where mutual 
interference exists among different D2D links and the total 
interference reaching a TV receiver is the cumulative sum of 
received signal power from all D2D links in its vicinity. An¬ 
other future work is to consider adjacent-channel interference 
limit required by licensed DTV services, where the correlation 
in frequency domain and the channel frequency selectivity as 
studied in should be carefully investigated. 

Appendix A 

Matrix Shrinkage Operator Su{') 

Assume M G and its singular value decomposition is 

given by M = Udiag(cr)V^, where U G G and 

V G Given a scalar > 0, is defined as 

S,{M) ■- Udiag(s,((T))vT (32) 

with the vector Sjy{a) defined as 

Su{a) := max{cr — z/, 0}, component wise. (33) 

Simply speaking, Sjy(cr) reduces every nonnegative singular 
value of M by u; if any singular value is smaller than u, it is 
reduced to zero. 
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